Multi - Devices Hindi Speech Database for Speaker Identification using GMM

نویسندگان

Sonu Kumar

Mahesh Chandra

چکیده

Abstract— In this paper, we study the effect on speaker identification (SI) system when speech data is recorded on two different sensors, a HP Pavilion third generation laptop and a Samsung mobile ( S3770K) both with built-in microphone in parallel in a closed room in noise free environment. The database contains 10 Hindi sentences (50-60 seconds speech) and one english sentence (7-8 seconds speech) of each 39 speakers (26 Male and 13 Female) in a reading style manner. Identification process adopts the methods of feature extraction based on Mel-frequency cepstrum coefficients (MFCC), linear predictive coding (LPC) coefficients. Gaussian mixture model (GMM) is used as a classifier. Our study shows that higher degradation in performance in case of mismatch of sensors during training and testing of data and MFCC performs better during matched conditions, LPC performs better than MFCC in mismatched conditions .

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Analysis of Speaker Identification System Using GMM with VQ

Personal identity identification is an important requirement for controlling access to protected resources. Biometric identification by using certain features of a person is a more secured solution for security identification. Advances in speech processing technology and digital signal processors have made possible the design of high-performance and practical speaker recognition systems. A more...

متن کامل

A Comparative Study of Gender and Age Classification in Speech Signals

Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...

متن کامل

GMM based clustering and speaker separability in the Timit speech database

Speaker recognition on the 630 speaker Timit speech database, using maximum probability selection with a simple Gaussian Mixture Model (GMM) for the data distribution for each speaker, gives above 99% correct recognition. In contrast, a powerful classifier such as a Multi Layer Perceptron (MLP), trained to estimate speaker probabilities, even on a small subset of speakers often performs no bett...

متن کامل

Speaker identification using Gaussian mixture models based on multi-space probability distribution

This paper presents a new approach to modeling speech spectra and pitch for text-independent speaker identification using Gaussian mixture models based on multi-space probability distribution (MSD-GMM). The MSD-GMM allows us to model continuous pitch values for voiced frames and discrete symbols representing unvoiced frames in a unified framework. Spectral and pitch features are jointly modeled...

متن کامل

Speaker Identification From Youtube Obtained Data

An efficient, and intuitive algorithm is presented for the identification of speakers from a long dataset (like YouTube long discussion, Cocktail party recorded audio or video).The goal of automatic speaker identification is to identify the number of different speakers and prepare a model for that speaker by extraction, characterization and speaker-specific information contained in the speech s...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

Multi - Devices Hindi Speech Database for Speaker Identification using GMM

نویسندگان

چکیده

منابع مشابه

Performance Analysis of Speaker Identification System Using GMM with VQ

A Comparative Study of Gender and Age Classification in Speech Signals

GMM based clustering and speaker separability in the Timit speech database

Speaker identification using Gaussian mixture models based on multi-space probability distribution

Speaker Identification From Youtube Obtained Data

عنوان ژورنال:

اشتراک گذاری